Sound Signal Processing Based on Seq2Tree Network

نویسندگان

  • Weicheng Ma
  • Kai Cao
  • Zhaoheng Ni
  • Xiuyan Ni
  • Sang Chin
چکیده

Most state-of-the-art solutions to sound signal processing tasks such as the speech and noise separation task and the music style classification task are based on Recurrent Neural Network (RNN) architecture or Hidden Markov Model (HMM). Both RNN and HMM assume that the input is chain-structured so that each element in the chain is equally dependent on all its previous units. However in real-life scenes the units alone do not carry much meaning. Only when several units group to be segments will they be semantically informative. This characteristic of sound signals clearly prefers emphasizing dependencies among units in the same segment, which leads to a natural selection of tree-structured models instead of chain-structured ones. In this paper we introduce Seq2Tree network and two models based on Seq2Tree architecture solving 1) speech and noise separation task and 2) music style classification task, respectively. Experiments show that our Seq2Tree-based models outperform the state-of-the-art systems in both tasks, which agrees with our hypothesis that sound signals have potential tree-structured dependencies among their sound elements. Also the experiment results prove the advancement of the Seq2Tree network architecture in sound signal processing tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sound Signal Processing with Seq2Tree Network

Long Short-Term Memory (LSTM) and its variants have been the standard solution to sequential data processing tasks because of their ability to preserve previous information weighted on distance. This feature provides the LSTM family with additional information in predictions, compared to regular Recurrent Neural Networks (RNNs) and Bag-of-Words (BOW) models. In other words, LSTM networks assume...

متن کامل

Seq2Tree: A Tree-Structured Extension of LSTM Network

Long Short-Term Memory network(LSTM) has attracted much attention on sequence modeling tasks, because of its ability to preserve longer term information in a sequence, compared to ordinary Recurrent Neural Networks(RNN’s). The basic LSTM structure assumes a chain structure of the input sequence. However, audio streams often show a trend of combining phonemes into meaningful units, which could b...

متن کامل

Tree Structured Multimedia Signal Modeling

Current solutions to multimedia modeling tasks feature sequential models and static tree-structured models. Sequential models, especially models based on Bidirectional LSTM (BLSTM) and Multilayer LSTM networks, have been widely applied on video, sound, music and text corpora. Despite their success in achieving state-of-the-art results on several multimedia processing tasks, sequential models al...

متن کامل

Performance of the Wavelet Transform-Neural Network Based Receiver for DPIM in Diffuse Indoor Optical Wireless Links in Presence of Artificial Light Interference

Artificial neural network (ANN) has application in communication engineering in diverse areas such as channel equalization, channel modeling, error control code because of its capability of nonlinear processing, adaptability, and parallel processing. On the other hand, wavelet transform (WT) with both the time and the frequency resolution provides the exact representation of signal in both doma...

متن کامل

Real-time damage detection of bridges using adaptive time-frequency analysis and ANN

Although traditional signal-based structural health monitoring algorithms have been successfully employed for small structures, their application for large and complex bridges has been challenging due to non-stationary signal characteristics with a high level of noise. In this paper, a promising damage detection algorithm is proposed by incorporation of adaptive signal processing and Artificial...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017